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Abstract 

We present a modified version of Buczolich and Mauldin's proof that the sequence of square numbers is 
universally L -bad. We extend this result to a large class of sequences, including the dth powers and the 
set of primes; furthermore, we show that any subsequence of the averages taken along these sequences is 
also universally L 1 -bad. 

1 Introduction 

Let (X, J 7 , /j.,t) a dynamical system. For a sequence of natural numbers {rik} and any / £ L 1 (X), we can 
consider the subsequence average 



1 N 



fc=i 

By analogy with Birkhoff 's Pointwisc Ergodic Theorem, we will examine the a.e. convergence or divergence 
of Apff(x) as N — > oo. 

We say that {rik} is universally L p -good if for every dynamical system (X, J 7 , fj,, t) and every / £ L P (X, p), 
lim A^f(x) exists for almost every x £ X. We say that {nfc} is universally LP -bad if for every non-atomic 

iV— >00 

ergodic dynamical system (X, J 7 , fj,, r), there exists an / £ L p (X,/i) such that the sequence {An f(x)}^ =1 
diverges on a set of positive measure in X. Finally, we say that {n^} is persistently universally L p -bad 
if for every non-atomic ergodic dynamical system (X, T , [i, r) and every infinite S C N, there exists an 
/ £ L p (X 7 fi) such that the sequence {AnJ(x)}n£s (N taken in increasing order) diverges on a set of posi- 
tive measure in X. 

Among the classical results in this topic, Bourgain [2] proved that (the integer part of) any sequence of 
polynomial values is universally L p -good for any p > 1, and Bourgain pQ and Wierdl [TT] showed that the 
same is true of the sequence of prime numbers. For these sequences, the Banach principle of Sawyer [5] 
implies that pointwise convergence of A^f for all / £ L 1 depends only on the validity of a weak maximal 
inequality 

|| sup |^jv/(x)|||i i0 o < VfEL\X). (1.1) 

JVGN 

The Conze principle [B] allows the transfer of such an inequality (with the same constant) from any ergodic 
dynamical system (X, p,, r) to any other dynamical system. Therefore, one of these sequences would be 
universally L 1 -good if and only if there were some fixed C > such that (jl.ip held for every dynamical 
system (X, J 7 , m, T), and it would be universally i^-bad otherwise. 
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This question of pointwise convergence for subsequence averages of L 1 functions remained open for vir- 
tually all sequences of interest, including all polynomials of degree > 2 and the sequence of primes, until 
Buczolich and Mauldin [3] [1] proved that {k 2 } is in fact universally i 1 -bad. 

In this paper, we adapt and extend the construction in [3] to prove the following theorem: 

Theorem 1.1. Let rift = k d for some d > 1 , or = the kth prime number. Given any C > and any 

infinite set S C N, there exists a dynamical system (X, J 7 , fi, r) and an f € L 1 (X) such that 



As discussed above, this implies 

Corollary 1.2. The sequence of dth powers (d > I) and the sequence of primes are persistently universally 
L}-bad. 

This exposition is self-contained, with the exception of the number-theoretic results of Hoolcy [8] and 
Granville and Kurlbcrg [7j. 

A few words on the structure of this paper: In Section 2, we present a heuristic version of the argument, 
in the case of the squares. Then in Section 3, we express the general form of our result (Theorem 13. ip and 
prove that its conditions are indeed satisfied by the dth powers and the sequence of primes. In Section 4, 
we present the main inductive step (Proposition 14. ip , show that it implies Theorem 13. li and explain the 
structure of the induction. 

In Sections 5-7, we construct the various objects of the succeeding inductive step and prove several nec- 
essary lemmas about them. Section 8 brings these parts together and proves that the properties claimed in 
Proposition 14.11 do indeed hold for this next step, completing the proof of Theorem 13.11 In Section 9, we 
retrospectively explain the purpose of several objects and lemmas in this intricate proof. 

Our notation will rarely distinguish between Zjy (a probability space with the measure-preserving trans- 
formation tx = x + 1 mod N) and Z. Sets and functions on Zjv will correspond to iV-periodic sets and 
functions on Z, and any object on Zjv is understood to represent an object on Zm/v for any M £ Z + . 

Furthermore, we let P denote the uniform probability measure on Zjv, and EX the expected value of a 
random variable X : Z^v — > K. Note that the values of P and E are unchanged when we consider a TV- 
periodic set or function as an object on Zmtv instead, so that we may use P and E freely without keeping 
track of N. We will use X = Y to denote that two random variables X and Y (not necessarily on the same 
probability space) have identical distributions. 

Finally, we will use both subscripts and superscripts on certain functions f^,g^,X^,Zj^ and certain sets 
A^ , . To prevent these from being confused with exponential notation, we note here that such superscripts 
on these objects will not denote exponents; we will therefore write the square of Xh as (X^) 2 rather than 



2 Outline of the Argument for the Squares 

Here we will present a heuristic outline of the argument in the original case = k 2 , before introducing the 
necessary complications (exceptional sets and the like). We therefore ask the reader's patience with these 
claims, some of which are not technically true; the argument presented in Section 3 and thereafter will be 
rigorous. 
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Buczolich and Mauldin's proof in [3] essentially boils down to three key insights. The first is that in order to 
prove that {k 2 } is universally I/ 1 -bad, it suffices to prove the existence of what they term (K, M) familietQ 
for arbitrary K, M G N. 

Definition Given K,M G N and a measure-preserving system (ft, J 7 , r, P), a (K,M) family on SI consists 
of the following: 

• fx, ■ ■ ■ , Ik G L 1 ^) with f h > and Ef h < 1 

• X\, . . . , Xk G pairwise independent with EX/j > M and E(X/j) 2 < Cm 

• A measurable function Q x : S7 — > N such that for a.e. x G SI, 




Note that for each x G SI, Q x does not depend on h. 

The point of constructing a (A', M) family is that, while a single Xh may have a weak L 1 norm no greater 
than the L 1 norm of /, an average of pairwise independent random variables with uniformly bounded vari- 
ance is subject to the Weak Law of Large Numbers. Thus for some large K, the average Xl ^ x )+"^ rXl <^ x ) wl vj 
be at least y ona set of probability at least i, giving a weak L 1 norm of at least while the L 1 norm of 
the average Msl±^t£g-fei remains < 1. 

Therefore if we have (K, M) families for all K, M G N, we can construct a dynamical system and a function 
that violate any given weak (1,1) maximal inequality. 

The second insight is that we can inductively construct a (K,M) family on the probability space Za? with 
the shift operator tx = x + 1 mod N, for some large N depending on K and M. We will use an inner 
induction: given a (K — 1, M) family we will construct fx and Xk in stages f K , X K , so that we have all 
the required properties except for 2A'/ x > M, which we replace with ILX K +1 > KX K + e. Then for L large 
enough, we will have a (K, M) family as desired. 

The main obstruction to this approach is the difficulty of maintaining pairwise independence; when we 
alter f K and X K , we must at the same time alter /f, . . . , f K _ 1 and Xf, . . . , X K _ 1 in order to sustain this 
property. We do this by taking into account properties of the distribution of the squares in residue classes. 

Definition Given 7 > 0, and given q G N squarefree and odd with n prime factors, consider the following 
subsets of the integers: 

A q := {x G Z : (x, q) = 1, 3k G Z such that x = k 2 mod q} 

(-KV ■■= -A, + (0, 7 2 K ) 

It follows from elementary number theory that every ifA, has exactly 2 K square roots modulo q; and that 
ilq=p 1 ...p K with Pi large, then P(A ? ) = n£=i E ~ 2_K - Clearly P((-A g )T) < 7; it follows from the 
result of Granville and Kurlberg [7] that P((-A g ) 7 ) w 1 - e^ 7 . 

Having set f K = X K = 1, at each step in L we will take some highly composite q = qL and set 

f L K + \x) := e7£(a:)lz\(-A^(aO ( 2 - 2 ) 
A^ +1 (x) := e ^(x)l zv „ A?) ,(x-) + cl*,(x), (2.3) 

1 Buczolich and Mauldin define these families with many more conditions, in the fashion that we will define our inductive 
Step (K,M,L) in Proposition 14. II but the definition here will suffice for heuristic purposes. 
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for some ^ q C (— A 9 ) 7 with P(^ g ) > erf. To maintain the property (|2.ip . for x ^ (— A 9 ) 7 we will keep the 
same length of averaging Q x we used for f^; but for x € (— A 9 ) 7 we will redefine to be a multiple of q. 
We will define the set ^> q such that if we add a generic square number to an element of ty q , we have a good 
chance of landing in the support of the result of [7] ensures that this is possible. 

We can ensure that X^ +1 remains independent from our original (K — 1,M) family by choosing q rela- 
tively prime to the period of that family; note that EAT^ +1 = EX^ + c 2 . However, we are not quite out of 
the woods: for x € (— A 9 ) 7 we are now using different values of Q x with f^ +1 than with our (K— 1, M) family. 

This defect is repaired by the third insight of [3J: using the inductive hypothesis, we take a different (K — 1, M) 
family {gf, . . . , gf { _ 1 , Z^, . . . , Z^_ lt Q' x }, where the lengths of averaging Q' x are relatively prime to Tq; we 
may then "restrict" it to (— A g ) 7 by taking 

9h(x) := q2- K g^{x)l {0n2 , )+qZ (x) 
Z%(x) := Z^{x)l { _ AqV {x) 

Qx ■= qQ'x- 

This preserves the property (|2.ip on (— A g ) 7 if we assume that 

9f:(r Q h) = 9f:(y + Q' x ) = f(y) Vye&x + iqQ^ 2 ), (2.4) 

since if x G (-A 9 ) 7 , then \{k G [l,qQ' x ] : x + k 2 G (0,72 K ) + qZ}\ > 2 K Q' X and this set is equidistributed 
modulo Q' x . This implies that for x G (— A g ) 7 , 

W J2gk(x + k 2 ) = — £ 2 K • <?2- K ^(x + k 2 ) > Z h (x) = Z h (x). 

k=l k=l 

(That is, averages of over long intervals of the squares look like averages of g^ on short intervals of the 
squares.) Then if for h < K — 1 we let 

X^\x) := X£(x)l zV _ Aq)7 (x) + Zi 

and set the new Q x to equal Q' x on the set (— A g ) 7 , we find that we have nearly preserved the properties of 
the family we began with. Thus we may iterate the inner inductive step (to which we must of course add a 
version of (|2.4p ) and thereby construct a (K,M) family. 



3 Main Theorem 

Definition For a set A C Z t , we can define 

t 

P(A)- 1 

A+(0, 7 s t ) cZt. 

Remark Note that this turns Z t into a probability space, that St is the average spacing between elements 
of A in Z t , and that P(A 7 ) < 7. If A C Z is periodic by t, we can consider it as a subset of Z nt for any 
n G Z + , and we see that P(A) is independent of n. 

The main result of this paper is the following: 



P(A) : = 

st ■= 
A 7 := 
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Theorem 3.1. Let {n k } C N an increasing sequence, and a,0 > 0. Say that for every integer t > I, there 
exists a set of residues A t C {n k +tZ : k G N} C Z t such that A st = A s n A t whenever (s, t) = I, and that 
there exist some auxiliary sequences {pj}, {qj} of pairwise relatively prime positive integers such that 

P(A„)-0, (3.1) 
infP(A p .)>0, (3.2) 

e 7 := 7- liminf P(A?) = 0(7), (3.3) 

j — >oo J 

such that for all 7 > sufficiently small, 

liminf G A g X A q : \u-v-w\ > js q , Viu G A q }| > 5a|A 9j | 2 , (3.4) 

and for all Q = p h . . -Pi k q n . ■ ■ q 3l with i x < ■ ■ ■ < ik, ji < ■ • ■ < ji, 

1 
liminf — |{1 < k < N : n k = a mod Q}\ > --— - Va G Aq. (3.5) 
n^oo TV |Aq| 

Then, given any C > and any infinite set S C N, there exists a probability space (X, T , P) , a measure- 
preserving transformation t on X, and an f G L^^X) such that 

1 N 

|| SU p|-£/0T"*|lkoo>q|/||l. 

Remark (|3 . 3[) states that A qj does not cluster too much in Z 9j . so that P(A^ ) is nearly 7, while (|3 .4[) states 
that the set of differences of elements in A qj is not concentrated near A qj . (|3.5p states that each point of 
Aq is hit uniformly often by the sequence {nk}, if Q is any squarefree product of terms from the auxiliary 
sequences {pj} and {qj}. (The function x — > we will later construct will take such products as its values.) 



Claim. Theorem \3.1\ implies Theorem \l.l\ 

Proof. For the sequence of primes, we take A t to be the set of integers relatively prime to t, then let pj be 

tt P ~ 1 

distinct primes and qj be highly composite such that q~ 1 (j){qj) — ► 0; since = 0, it is possible to 

p prime 

choose such g^. The property (|3.5|) is clear from Dirichlet's theorem on arithmetic progressions. 

For the sequence of dth powers, we will take A t to be the residues of k d mod t which are units mod t. 
We will take pj to be distinct primes congruent to 1 mod d. and qj to be products of j such primes. Note 
that if Q has k prime factors each congruent to 1 mod d, then every x G Aq will have precisely d K dth roots 
in Zq, and P(Aq) = <f>(Q)d~ J . If we choose all of the prime factors sufficiently large, we can ensure that 
</>(Q) ^ \ f° r a H squarefree products Q of these sequences, so that (|3.5p is satisfied. 

To prove f|3. 3|) and p.4p for each of these cases, we will use some recent results on the distribution of 
the residues A qj (as the average spacing s qj — > 00); roughly speaking, in each case they are distributed 
locally like a Poisson process of rate P(A g A We begin by introducing some notation. 

Definition Let {yk}k=i De a strictly increasing sequence of real numbers in [0, Af), and consider it as a 
subset of T = R./N'Z. For E C {1, . . . , A^}, and 9 > 0, we define a probability measure and a cumulative 
distribution function: 

\E\ 

ME) := y (3.6) 
F{6) := f k {\y k+1 ~y k \ > 9). (3.7) 
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Now if we consider the normalized set s^ 1 A g = {yi : 1 < i < \A q \} c [0, |A g |) (where the yi are taken in 
increasing order), we can examine the cumulative distribution function F q (9) defined as above. Hooley [8] 
proves for the primes, and Granville and Kurlbcrg [7j prove for the dth powerqj, that F q (6) — > e~ e pointwisc 
as s q — > oo (for which reason they call these sets "Poisson distributed" ) . Thus 



P(A 



|A g |- 1 V| 2 / i+1 - 2 /,|A 7 = f F q (6)d6^1 

Jo 



so that we have (|3.3p . To prove (|3 .4[) from this distributional fact, however, requires a little more work. 
Lemma 3.2. Let Q(9) := Piflj/j - y k - yi\ > 9Vk). For 9 > and J > 4, 

P* (c«(g)> - F(Jg)+ f 2g) " 1 ) > F(29)-F((J-2)9). 

Proof. Define A := {k : \y k+1 - y k \ < J9}; then P k (A) = 1 - F(J9). 
For z G [0,iV), let 

£ z := {i :3k e A : \ Vi - y k - z\ < 6} 
C z := {i:3k<?A:\ yi -y k -z\<6}. 

There are \B Z \ points j/j — z contained in the \A\ intervals [y k — 9,y k + 9]; thus it must be that for at 
least \B Z \ — \A\ of these points, their successors yi+\ — z also lie in the same interval, which implies that 
\Vi+i -Vi\< 20. Thus Fi(B z ) < P fe (A) + 1 - F{26) = 2 - F{J9) - F{29). 

Therefore, for any I, 

1-0(0) < Pi(Bk# A:\yi -y t -y k \< 0)+fi(Bk€ Alfa -y k -y t \< 6) 
= h{C m )+h{B yi ) 
< Fi(C yi ) + 2-F(J9)-F(29). 

But if i G C Vl and 29 < \y t - < ( J — 2)9, then 

Vi - Vi-i € (y k + 9, y k+ i - 9) 
so that \yi — y k — yi-i\ > 9Wk. This implies that 

Q(9) + Ci-i(O) > Q(9) +P l (C m ) > F(J9) + F(29) - 1 
whenever y t - G {29, {J - 2)9). 
Therefore 

p ; (cm > EM+I^t^ > ip,(o(A) + > + f(2A) - 1) 

> Pi(29<yi-yi-i<(J-2)e) 

> F (29) - F ((J -2)9). 

□ 

2 While Corollary 2 in that paper is stated for the entire set of dth power residues modulo q, we see that passing to the subset 
A.J does not change equation (1) and thus we may apply the same result. 
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Now if 7 < then e 27 > | and we can set J := =^ > 4; then Lemma 1X121 implies 

Pi(0>l/8) > P; Ui> ~ 1 + e "7 + e 2T +Q ^ 1 )) 

> e -27_ e (2-^ + . (1) > 1 

for j sufficiently large, so that (|3.4[) holds with a = — □ 

Remark Note that (|3.5[) cannot be satisfied by polynomials other than n k = Cdh d + Co, as can be seen from 
Cohen's result [5] that if we fix a G 1i v [x] of degree d which is not of this type and consider a(x) — y as 
y £ "Zip varies, for some fixed proportion of y this polynomial will have d distinct roots, while for some fixed 
proportion it will have 1 root. This prevents (|3.5[) from holding for sufficiently composite products. However, 
the author expects that every polynomial of degree 2 or greater over Z should be persistently universally 
I^-bad, and that some clever variant of this argument should suffice to prove as much. 



4 The Inductive Step 



For an inductive argument to work, we will have to specify additional properties of the objects we seek, 



including a fixed distribution for the functions Xh 



Definition Given the sequences {pj} and {qj} as in Theorem 13.11 denote the set of their (squarefree) 
products 

Q := {p n . . .p ik q jl . . .qj, : h < ■ ■ ■ < i k , ji < ■ ■ ■ < ji} (4.1) 
and the functions (depending on the infinite S C N in Theorem 13. ip 



N(Q) := max In : 3a e A Q : |{1 < k < N : n k = a modQ}|</^-} (4.2) 

I I a qIJ 

ip(n) := inf{s eS:s> N(Q) VQ G Q, Q < n}. (4.3) 

Definition For < 7 < 1 and < a < 1, we recursively define functions Y n i a : [0, l] n+1 — > M, which we 
may consider as random variables with the Lebesgue measure. Let ^0,7,0(^0) = 1; an( i 

):=(!- 7) lY n,-y,ct(xo,---,x n )l[ 1>1 )(x n+1 ) +al[ 0iOll )(x n+ i). (4.4) 

Note that 

E(y„, 7 ,a) = Ey„_i j7 , Q + a 2 7 = 1 + na 2 7, 
E(r„, 7 , Q ) 2 = (1 - 7)" 1 E(y i -i, 7 ,a) 2 + « 3 7 < (1 - 7)"™(1 + « 3 ) < 2(1 - 7 )""- 

Given the conditions of Theorem 13.11 we may assume that all the pj and qj are odd and that a is a dyadic 
rational. Choose 70 < 1/2 such that for all < 7 < 70, ()3.3p and (|3.4|) hold and £ 7 < a 7 ; we will write 
e = e 7 from now on unless otherwise specified. We are now ready to state the inductive step: 

Proposition 4.1 (Step (K,M,L)). Assume the conditions of Theorem \3.1\ Given any dyadic rational 
< 7 < 7q, any K 1 M, LgN with L < M , constants A, 5 > with S < 8e, and an odd integer D, there exist 
T € Q and R € N with (T, D) = (i?, D) = 1 and the following objects: 

(1) A /jtfi^Zr) withf h >0 andl<EA < (1 + 4 £ )( A '" 1 ) A/+L . 

(2) Xi, . . . , .Xjf € f 1 (Z^t) pairwise independent with Xh = iM.-y.a V/i < A' and = YL. 7 ,a 
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(3) An exceptional set E C Ijt with ¥(E) < S 

(4) Q x : 7Lt —* Z + with Q x S Q and Q x \ T \/x, such that for each x E, 

1 



fhix + a)>X h {x) \fl<h<K 



Ka<C 



and 



h(x + y-Q x ) = f h (x + y) VI < h < K, Q x <y< ip(AQ x ). 



(4.5) 



(4.6) 



Remark In the final section of this paper, we will discuss the significance of the parameters A, D, R, and 
5, as well as the distribution Y n ^ t(Dl , the reason we require 7 and a to be dyadic rationals, and other points 
whose necessity in the argument is not immediately obvious. For the time being, we ask the reader's trust 
that these complications are required in order to make a strong enough inductive step. 



Claim. Proposition \4-l\ implies Theorem \3.1 



Proof. If we fix C > 0, take 7 > small, and set M = [C/j\ and K = \_^/Ce\ , and take 5 < \, A = D = 1, 
then at Step (K, M, M) we have 

Ef h < (1 + Ae) KM < (1 + Ae) 1/e < e 4 , 
EX h = EY M)7 ,a > 1 + Ma 2 j > Ca 2 , 
E(X h ) 2 = Er^ 7iQ <2(l- 7 )" M <2e 2 T M <2e 2C . 

Therefore, since the Xh are pairwisc independent and identically distributed, 



A' 



A" 



K 



< Ca 2 /2 < 



X 1 



X 



K 



K 



-EXi 



> Ca 2 12 



< (Ca 2 /2)- 2 E 



X 1 



X K 



K 



IXx 



< 4a _2 C _2 ii:- 1 E(Xi -EX1) 2 

8e 2C 
~ a 2 C 2 K 

and for 7 sufficiently small (by (|3.3p . this means K = \^/Ce\ sufficiently large), this is less than i. 

Now if we consider the probability space Iirt with the measure-preserving transformation tx = x + 1, 
and set / = /1 H + fa and X = X\-\ h X K , then by J33]), (@~5|) and (gH) 



(4.7) 



1 B 

£ K x + n *)>]^l E f{x + a)>pX{x) Vx?E, 



k=l 



a£A c 



and since i}(Q x ) G S, this implies 



Therefore 



1^,, „ „ CKa 2 p Ca 2 f3 

H^P^E/ 07 - ,lfc lli,oo>^^>^ 



-F(E)>-. 



I/II 



fc=i 



Since C is arbitrary, there can be no maximal inequality. 



□ 
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We will prove Proposition 14. II by induction on K and L. We fix M and 7 at the beginning of the argument 
(since they will not change as K and L change), and prove each Step (K, M, L) for all values of A, 5 and D. 

Note that Step (1,M, 0) is trivial, and that Step (K — 1,M,M) implies Step (K,M,Q): fix the parame- 
ters A, 6 and D and obtain . . . , fic-i, X-y, . . . , Xk-i, E, Q x } satisfying (l)-(4) on It- Now set fx = 1, 
Xk = 1 on Z. This clearly satisfies the conditions. 

Therefore, to prove Theorem 13. 1[ it suffices to show that if L < M and we know Step (K, M, L) and 
all previous steps (for all values of A, 5 and D, and for a fixed 7), we can prove Step (K,M,L + 1) 
for a fixed A, 8 and D and the same 7. We start by applying Step (K,M,L) with A, 5/4 and D to 
obtain a family {Tj,, Rl, /f, . . . , Xf , . . . , X^, El, Qx,l} satisfying (l)-(4); we must then construct 
{T L+ i, Rl+i, fx + \ fk + \X^ +1 , . . . , JS L+ i, Q x ,l+i} to have the required properties. 

5 Periodic Rearrangements 

As in [3], we will essentially construct the next functions by choosing a new q and redefining the current 
functions on the subset (— A 9 ) 7 . On this set, we will be selecting a new Q x ,l+i 3> Tl, and we want to ensure 
that the left side of (|4.5|) will be uniformly large on a significant subset of (— A 9 ) 7 . Since an average of ffc 
over Aq (in the sense of (]4.5[) may be irregular for large Q, we will modify the ffc in advance so that these 
averages will be bounded below by a constant, while preserving their averages over Aq for smaller Q. 

Following Buczolich and Mauldin, we call this modification a periodic rearrangement. Given natural numbers 
p^- T with (p, T) = 1, we will define a linear operator / — > / from £ 1 (Zr) to ^ 1 (Z p r) which preserves joint 
distribution of functions, such that on long blocks each / is identical to a translate of /. 

In our particular cases, where At consists of the residues of dth powers or the integers relatively prime 
to T, we can simply define 



(For the former case, we would use the Polya- Vinogradov inequality on character sums; for the latter we 
would use Dirichlet's theorem.) 

However, we lack such tools when considering more general sequences, so we shall instead use an exter- 
nal randomization in our construction of /. Let f2 be a probability space and £i(u>) be independent random 
variables on fl, each with a uniform distribution on the discrete set {0, . . . , T — 1}. Then for any / £ I^^Lt) 
and oj € f2 we define (on the interval [0,pT)) the function 




Heuristically speaking, we break X p t into blocks of size TL-y/pJ anc ^ shift each by a random variable £i(w). 
The point of this is that, although we cannot directly prove that A p t will be cquidistributed in the residue 
classes modulo T, there must exist some value of these shifts under which this is approximately so. 

Similarly, for a set E C Zy we can define E^ := supp C 1> p t- Note that E u contains exactly [^/^j 




and prove directly that for every x € Z p t, 
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points in each full block, and (p — ( |_V^J ) 2 )l^l points in the last block; thus we see that P(£" lj ) = P(E) 
for all E C Iit and all u) G Q, from which it follows that this periodic rearrangement preserves the joint 
distribution of any collection of functions. 

For most x G Z p t there exists an x, such that f u (x + z) — f(x + z) for < z <C ^fp. In particular, if 
we take our original exceptional set E C Z^, wc can define a new exceptional set 

^H-^-U |J [zTLVpJ -^(AT),iT[^p\]+pZ (5.2) 

0<i<^5-l 

such that (|4.5j) and (j4.6|) are still satisfied for f^ ,IJ , X^'^ and Q% L off of E 1 ^). Furthermore, 

P(E\u J ))<P(E) + ^l±^<5/2 
T^/p 

for p sufficiently large. 

Now we can prove the following lemma: 

Lemma 5.1. For p sufficiently large with (p,T) — 1, there exists uj G fl such that for all < / G £ x (Zt) 
and all x G 1 v t, 

1 PJ 1 a£A pT 

Proof. By the linearity of the periodic rearrangement, it suffices to prove that for some o>, this holds for all 
characteristic functions of singletons in Z^: thus it is enough to show that for all x G Z p t and b G Z^, 

Fix 6 and a;; then counting only the main blocks, 

J2 + > Yl \{ae& P T:(i-l)lVp}T<x + a<il^p}T, x + a + £i{u>)-beTZ}\ 

i 

where Pi(w) := { [{% - 1) |ypjT - x, i L^pJ T - x) {b - x - & + TZ}} is an arithmetic progression in 1 p t- 
If we then define 

Vi(u) :=p- 1 ' 2 \K pT ^P l {uo)\, 

we see that < Vi < 1, that the are independent, and that each z G A p ^ PI [(i — 1) Lv^pJ T — x,i \_y/p\ T — x) 
contributes to Vi{u>) for precisely one value of Therefore 

Var w i/i(c<;) < E^ 2 (w) < E w ^(cj). 
Recall that inf P(A Pj ) > 0, so that for p > T, \A pT \ = |A P | • |A T | > 2\/p. Thus for p sufficiently large, 



ME^M) > ^|a p t\[^-t(p-(Lv^J) 2 )^]|> 3|Apt| 



^(w)) < 
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and we may apply Chernoff's Inequality (Theorem 1.8 from [TU]) to find 



< 



(x>*>-«^>|^) 



2eXP< - MT^p < pT 2 



for p sufficiently large (depending on T) . □ 

6 Defining 

Using the properties (|3. ip ~ (|3.4p . we now take p = Pi(k.l) an( i Q ~ Qj(K,L) from our auxiliary sequences such 
that they are relatively prime to each other and to Tl, Rl and D, such that p is large enough for Lemma 
1531 and such that q > D, s q > %5~ l ijj(AT) and 

P((-A,)T)=P(Aj)> 7 -e, 
|{(u, u) G A q x A g : |u - v - w\ > ~/s q \/w G A 9 }| > 5a|A g | 2 . 

We would like to define f^ +1 and X^ +1 as in (|2.2p and (|2.3[) . but then X^ +1 will not precisely equal Yr,+i j7iQ 
in distribution. This is the reason we will make the X^ +1 periodic by Rl+iTl + \ rather than just Tl+i: we 
will later multiply the parts of X^ +1 by the characteristic function of intervals whose lengths are appropriate 
multiples of It will be essential (for its use in later inductive steps) that we keep (Rl+i, D) = 1, and 

for this we will need to define f^ +1 and X^ +1 in a more complicated fashion. 

First, we will let 

$ g := {u G -A q : \{v G A q : u + v £ -A, + (-js q ,js q )}\ > 2a\A q \} (6.1) 
and note that |$ g | > 3a|A g |. We then consider $^ = $ 9 + (0, s q ^f) C (-A 9 ) 7 . Now P($]) > 2 07 since 

7 - e < P((-A,)*r) < P(*2) + g-^sj - A, \ $,| < P($p + 7 (1 - 3a) 
and we have stipulated that e < cry. 

Now we choose two sets ^ q , A q C Z 9 such that 

C $^ (6.2) 

P(* 9 ) > 07 (6.3) 

Z, \ (-A q )-r CA,CZ,\$, (6.4) 

P(A 9 ) < 1 - P(A^) + 5/8 (6.5) 

(|*,|, J D) = (|A g |,i?) = l. (6.6) 

(As q^> D, this last condition is clearly possible to satisfy simultaneously with the others.) We observe that 

\{v G A q : x + v G AJ| > 2a|A 9 | Vie$„ (6.7) 



ff 



and accordingly we define 



fk +1 ■= (i-7)-7^iA e G^(z 



qpT) 



Note that (q,pT) = 1 implies E/£ +1 = (1-7)" 1 P(A 9 )E/^ = (1 - 7)- 1 P(A 9 )E/^; since 7 < 1/2 and (5 < 8e, 
this means 

E/£<E/a- +1 <(i + 4£)E/£. (6 . 8) 
The goal of Lemma 15.11 and the definition of ^ q is the following lemma: 

Lemma 6.1. Let T = T^,, wzt/i p and g chosen as above. For all x G "fg and /or any Q 6 Q smc/j i/iai 
gpT I Q, 

TT-f E + > «■ 

at*. 

Proof. Let £? := Since Q G Q is squarefree, we sec that B, T,p and q arc pairwise relatively prime, and 

E E( 1 -^ _1 ^(^ + ")lA g ( a : + «) 



>- ^ E /£(*+«) 

> aE/£ > a 

using (|6.7p for the first inequality and Lemma ISTTl for the second. □ 
We also define an additional exceptional set 

El := {x G A q : 30 < y < tp{AQ XiL ) such that x + y g A q }. (6.9) 
Note that £| C {A q n (-A 9 ) 7 } U {-A g + (-^(AT), 0]}. By our choice of q, we see that 

F(El)<S/8+\A q \ij(AT)< 5 -. 

7 Restricting a Family to (— A g ) 7 

As noted in the heuristic outline, if we wish to change Q x on the set (— A g ) 7 , we must change f[, . . . , f^-i 
and Xi,...,X^-_ 1 as well, since these need no longer satisfy (4) with the new value of Q x . In or- 
der to find suitable replacement functions on (— A g ) 7 , we will use take a Step (K — 1,M, M) family 
{S, R', gi, ... , gn-i, Z\, . . . , Zk-i, E', Q' x } with suitable parameters, and then restrict the functions gh to 
the set (0,7s g ) + gZ (multiplying them by |A ? | so that their i 1 norm is 7 times its previous value). The 
averages along A q starting at any x G (— A g ) 7 will sample from this set uniformly often, so that we may be 
able to preserve property (4) there. 

This restriction will not preserve the independence of the Zh as such, since the averages of the restricted 
gh must be off of (— A 9 ) 7 . However, we may define the conditional expectations E(X | E); and these will 
remain independent, for E G (— A g ) 7 . 
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Definition Let X be a random variable X on a discrete probability space (f2,P), and let E C 51 with 
P(E) > 0. We define the conditional expectation E(X | E) to be the random variable on E with E(JT | 
E)(x) = A"(a;) Va; € E, where E is equipped with the probability measure Ps(x) = P(E) _1 P(x). 

Now we can state the actual form of this restriction for an entire Step (K, M, L) family: 

Lemma 7.1. Let {T, R, /i, . . . , fx, X\, . . . , ATr- , E, Q x } satisfy the properties of Step (K, M, L) for the pa- 
rameters A, S, D. Say we have q, B G Q with g, B and T pairwise relatively prime, and qB < A. Let 



"9 


:= {0,-ys q )+qZ 


h{x) 


:= \A q \f h (x)l sl (x) 


X h (x) 


:= X h {x)l { _ AqV {x) 


E 


:= En(-A q r 


Qx 


■= qBQ x 


A 


A 

qB 


D 


D 


(D, qB) 


T 


:= qBT. 



Then (T, D) = 1 and 

(1) A, . . . , f K e with f h >0 and Ef h < jEf h 

(2) Xi, . . . ,Xk G i l CL R rp) such that for any nonempty q-periodic E C (— A 9 ) 7 , E(A\ | E), . . . ,E(Aa- | E) 

are independent and E(X^ | E) = Xf L - 

(3) E C If with P(E) < <5P((-A ? ) 7 ) 

(4) Q x ■ %t ~ * ^ such that Q x \ T Va* G and /or eac/i x £ E, 

1 



E /h(a: + o)>X h (x) Vl</i</v 



aeA f ; 



A(x + y - Q*) = / h (x + 2/) VI < h < K, Q x < y < ip(AQ x ). 



Proof. Since (q,T) = 1, any q-periodic set contains an equal portion of integers from each residue class 
modulo T. This fact quickly implies properties (l)-(3), noting that the joint distribution of the Xh on 
any g-periodic E C (— A g ) 7 is the same as their joint distribution on Iirt- Now for property (4), take 
x G (— A 9 ) 7 \ E (since it is trivial otherwise). Since Q x , q, and B are relatively prime, 



|A ( 



Y h(x + a) = 



aeAr 



|Ai 



E 



|A,q. 



Y fh(x + z) 



zeA„ 



t^i E \A q \l s ^(x + v) ■ j^— Y fh(x + w) 



> 



^ 1 «GA 
1 



since if .t G (— A g ) 7 , there must exist some o€ A, such that x ■ 
Finally, AQ X = AQ X so (|4.6|) implies the last claim trivially. 



v G El 



□ 
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8 Completion of the Inductive Step 

Now we arc ready to define the other functions and prove Step (K, M, L + l); but since so much goes into 
this step, we will show the origins of the various pieces. 

We began with {T L ,R L , f[, . . . , /£, X{ , . . . , X^,E L , Q x> l} satisfying (l)-(4) with the parameters A, 5/4, 
and D. We modify these objects in several ways, using a new p = Pi(K,L) an d q = 1j(K,L) chosen in Section 6. 

First, we applied the p-periodic rearrangement / — ► f u defined in (|5.ip to the functions ffc and Xfc, 
with uj chosen as in Lemma [5.11 and defined an associated exceptional set E L (ui) in (|5.2[) . We defined sets 
A q ,ty q C 1 q satisfying (|6.2[) - (|6.7[) . then we defined the function f^ +1 in (|6.8p and an additional exceptional 
set El in 



We are now ready to proceed. 



Set Al '■= AT^pq and Dl '■= DT^pq. By our strong inductive hypothesis, we may assume Step (K—l, M, M) 
and thus construct a family {S, R' , gi, . . . , gi<-i, Z\, . . . , Zk—i, E' , Q' x } satisfying (l)-(4) with parameters 
Al, (5/4, Dl- (In particular, S and R' are each relatively prime to Tl, p, q, and D.) 

Applying Lemma 17.11 with B = Tp, we obtain {S, R', g±, . . . , (Jk-i, Z\, ■ ■ ■ , Zk-i, E' , Q x } on (— A g ) 7 sat- 
isfying (l)-(4) with the parameters Al = A, 5/4, and Dl = D. 



Then we define 



T L +i := S = ST LP q (8.1) 

El+i := E 1 l (lu)UE 2 l UE / (8.2) 
Qx,l, x e Ag 

Qxi X T Aq, 



■ L + l 



(8.3) 



and for 1< h < K - 1, 



We have already defined 



fn +1 := //f , lA,+5az\A,G^(ZT £+ i) (8-4) 
X^ +1 := x£l Aq +Z h lz\ Aq et\Z TL+1 ). (8.5) 



Ik +1 ■■= (l-7)-7j&lA,G^(Zii + i)- 

It thus remains to define A"^ +1 . As noted before, we cannot simply define it as in (|2.3p ; we must reduce it 
slightly so that it equals Yl+\,i,o, m distribution. 

Recall that P(A g ) > 1 — 7 and P( V I' (? ) > cry; we have taken 7, a to be dyadic rationals and assumed 
(|6.6p , so we may write 

l ~ 1 ai 1 (8.6) 



P(Ag) r' P(*q) r 
with (r, D) = 1. (Recall that -D is odd.) Everything so far is periodic with period Tl+iRlR' ', so if we define 

:= R L R'r 

:= [0, sR'RlTl+i) + Rl+iTl+i^ 
:= [0, tR RlTl+x) + Rl+iTl+iZ 
:= (l-7)- 1 Ail A? lr s +alv t ,lr f , 



Rl+i 

r, 



yl+l 
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then X^ +1 = yh+i,7,Q and all of the X^ +1 are periodic with period Rl+iTl+i- 

We now have a family {T L+1 , R L+1 , . . . , f^+\X^+\ X^+\ E L+1 , Q x , L +i}, with T L+1 e Q and 

(Tl+i, D) = (Rl+i,D) = 1. We must check the four properties of Step (if, M, L + 1): 

(1) f K +1 € ^(Zt, +1 ) with > and 1 < E/,f +1 < (1 + 4 e )^-i)M + L + x _ 
Proof. For 1 < h < K — 1, by the inductive hypothesis we see 

E.^ +1 = E/,fP(A f; )+E.g£ 

< (l- 7 + e)E/^+ 7 E^ 

< {l + s)(l + Ae)^-^ M+L 

< (1 + Ae) {K ~ 1)M+L+l . 

For this follows from (|6.8p and the inductive hypothesis. □ 

(2) Xf +1 , . . .,X K +1 e 1 1 (1<r l+1 t l+1 ) pairwisc independent with X^ +1 = Y M>J)OI for 1 < h < K - 1, and 

Proof. We have inductively assumed that X[ , . . . ,X^ are pairwise independent; since the periodic 
rearrangement preserves joint distribution, this is true of X± , . . . , X K as well. We begin by considering 
the conditional expectations of the Xfc on A q . Recall that q is relatively prime to p, Tl, Rl, S, and 
i?', so that A q contains an equal proportion of all residue classes modulo pT L R L SR' . Furthermore, T s 
and r t each contain an equal proportion of all residue classes modulo R' RlTl+i- Thus for any h < K 
and \h, Xk > 0, 

F(x e A q : Xf+\x) > X h ,X^ +1 (x) > X K ) = ¥{x g A q nT 3 ,X^(x) > X h ,X K (x) > X K ) 

= ^P(A q )¥(Xt(x) > X h )F(X K (x) > X K ) 
= V(A q )P(X^ +1 {x) > X h )nX K +1 (x) > X K ). 

Thus E(X^ +1 | A q ) and E(X^ +1 \ A q ) are independent; similarly, E(X^ +1 | A,) and E(J^, +1 | A q ) 
are independent for any h < h! < K. 

We proceed similarly on the rest of Z, letting I] denote either \t g or Zt l+1 \ (A q U ^ q ); on each 
of these, Lemma O implies that E(Zf | £), . . . ,E(Z K _ 1 | S) are pairwise independent and distributed 
like Y M ,-y, a - Clearly E(l r< | S) is independent of any of these, so E(X 1 i+1 | S), . . . ,E(x£ +1 | S) are 
pairwise independent for £ = ^ or 1<t l+1 \ (A q U ^g). 

Now for each 1 < h < K, 

E(X£ +1 | A,) = E(X,t +1 | %) = HXh +1 I Zr i+1 \ (A, U *,)) ^ ^ F M , 7 ,a, (8.7) 
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and so for any 1 < h < h' < K and Xh, Xh' > 0, 

P(X^ +1 {x)>X h ,X^ +1 (x)>X h ,) = ^P(^E:^ +1 (i)>A /l ,4 +1 (i)>Av) 

s 

= ]TP(Ep^ +1 | E)(z) > A fc) E(X£ +1 I S )M > V)P(S) 
s 

= ]Tp(E(A^ +1 | E)(a:) > A h )P(E(X£ +1 \ E)(x) > Ay)P(E) 
s 

= J>(A^ +1 (x) > A,)P(E(A^ +1 | S)(x) > A h 0P(S) 
s 

= P(^ +1 (x) > A/,)£>(a: G E : > A*) 

s 

= P(^ +1 (x) > X h )P(X^, +1 (x) > Xh>). 

(The sum is over the sets E = A g , and ^t l+1 \ (A g U the property (|8.7j) enters in at the fourth 
equality.) Thus we have preserved independence. 

We have already noted that X^ +1 = Yr,+i j7 , a . □ 

(3) An exceptional set E L+ i C with P{E L+1 ) < 8. 

Proof. F(E L+1 ) <P(El(uj))+P(El)+P(E' L ) <5/2 + 5/4: + 8~//4<S. □ 

(4) Q x = Q x ,l+i ■ %t l+1 — * ^ + with Q x G Q and Q x | Tl+\ Vx, such that for each x El+i, 

T7T1 £ rt +1 (x + a)>X^(x) Vl<h<K 

and 

fh(x + y~Q x ) = h{x + y) VI < h < K, Q x < y < ^j{AQ x ). 

Proof. Since Q X , L \ T L and Q' x \ S, clearly Q x , L +i \ T L+1 ; in addition, Q X>L G Q and = pqT L Q' x G Q 
(this is squarefree since p, Tl, and Q^. are pairwise relatively prime). 

For x e A q \ E L+ i and 1 < y < i/j(AT l ), we see that 

fh + \x + y) = ffo + y) = ti(x + y + 6H) 

for some w fixed and i depending only on x. Thus (4) follows from the previous step. 

On and Z \ (A g U ^g), since pqTr, \ Q' x , this is just Lemma 17.11 for h < K, and for h = K 
this is Lemma |6 . 1 1 combined with the observation that f^ +1 is periodic by pqTj^. □ 

Thus we have proved Step (K,M,L + 1); by induction, we have proved Proposition 14. II and Theorem 13. II 
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9 Notes on the Proof 



Several of the conditions, parameters and lemmas in this complicated argument appear on a first reading to 
be extraneous to the proof. In the interest of clarity, we find it helpful to outline in hindsight the purposes 
of the following: 

• The condition (|3.3[) lets us prove Theorem 13.11 from Proposition 14.11 (note that this proof requires 
7/Ce 7 — > oo as 7 — > 0). (|3.4[) comes in at (|6.1[) and the subsequent definition of ^> q , and (|3.5[) allows 
us to claim (|4.7[) . 

• Prescribing an exact distribution Y n ,~i,a (defined in (|4.4[1 ) for the X^ is necessary in order to guarantee 
(|8.7p . which ensures that pairwise independence is preserved. 

• 7 and a must be dyadic rationals, and A g and must be chosen to satisfy (|6.6|) . so that we can 
assume (r, D) = 1 in (|8.6p . so that we can have (i?, £)) = 1 in Step (K, M, L), so that we can choose 
our (K - 1, M, M) family with (R', q) = 1, so that X^ +1 will be independent of the other X^ +1 . 

• The parameter D lets us guarantee that when we inductively introduce a Step (K — 1, M, M) family, 
we can ensure that its period R'S is relatively prime to T^,p, and q, thus allowing us to apply Lemma 

rm 

• We have two distinct parameters T and R because we will need the period of to be squarefree in 
Lemma 16 . 1 1 (because the result of [7] only applies for squarefree moduli), but the operation of reducing 
Xk +1 to its proper distribution will multiply its period by a large power of 2. 

• The condition (|4.6p is necessary for (|4.7[) . connecting the actual averages over the sequence {n^} with 
the averages over a set of residues |Aq| G Zq. The parameter A must be allowed to take arbitrarily 
large values, although it need only be > 1 when used in (|4.7p . because each application of Lemma \7~T\ 
divides it by a large constant. 

• The periodic rearrangement defined in (|5.ip puts a uniform lower bound on the averages of /Jf over 
Qx,l+i on a set which depends only on q and not on f^; this allows us to prove Lemma 16. II 

The author thanks his dissertation advisor, M. Christ, for several key suggestions including the external 
randomization, and M. Wierdl for many encouraging and stimulating discussions on this topic. 
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